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Abstract 

J. Hopcroft and D. Sheldon originally introduced network reputation 
games to investigate the self-interested behavior of web authors who want 
to maximize their PageRank on a directed web graph by choosing their 
outlinks in a game theoretic manner. They give best response strategies 
for each player and characterize properties of web graphs which are Nash 
equilibria. In this paper we consider three different models for PageRank 
games on undirected graphs such as certain social networks. In undirected 
graphs players may delete links at will, but typically cannot add links 
without the other player's permission. In the deletion-model players are 
free to delete any of their bidirectional links but may not add links. We 
study the problem of determining whether the given graph represents a 
Nash equilibrium or not in this model. We give an 0(n 2 ) time algorithm 
for a tree, and a parametric 0(2 k n 4 ) time algorithm for general graphs, 
where k is the maximum vertex degree in any biconnected component 
of the graph. In the request- delete-model players are free to delete any 
bidirectional links and add any directed links, since these additions can 
be done unilaterally and can be viewed as requests for bidirected links. For 
this model we give an 0(n 3 ) time algorithm for verifying Nash equilibria in 
trees. Finally, in the add- delete-model we allow a node to make arbitrary 
deletions and the addition of a single bidirectional link if it would increase 
the page rank of the other player also. In this model we give a parametric 
algorithm for verifying Nash equilibria in general graphs and characterize 
so called a-insensitive Nash Equilibria. We also give a result showing 
a large class of graphs where there is an edge addition that causes the 
PageRank of both of its endpoints to increase, suggesting convergence 
towards complete subgraphs. 
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1 Introduction 



Introduced by Larry Page and Sergey Brin [T2], the PageRank of a web page 
is an important basis of the Google search engine and possibly one of the most 
successful applications of a mathematical concept in the IT world. PageRank is 
a value that is assigned to each web page according to the stationary distribution 
of an a-random walk on the web graph. Here an a-random walk is a random 
walk modified to make a random jump with probability a at each step and a 
random jump is a move to a node according to a given distribution vector q. 

Unlike rankings based on content such as keywords, tags, etc., PageRank 
focuses solely on the hyperlink structure of the given web graph. Web links 
themselves possess strategic worth and hence web authors often try to boost 
the PageRank of their web pages by carefully choosing links to other pages. 
Since these authors behave strategically in a self-interested way, this is a typical 
example of a non-cooperative game. In fact, Hopcroft and Sheldon recently 
introduced the PageRank game as a game theoretic model played over a directed 
graph 6J . Each player is identified with a node, and a strategy is a specification 
of a set of outlinks to other nodes. The payoff for each player is the PageRank 
value for their node which is calculated on the resulting directed graph. The 
obvious goal of each player is to maximize their payoff. 

In [6], the authors proved a nice property of this game, namely the best 
strategy of a player v is to place her outlinks to the nodes u having largest 
potential value <puv ■ The potential (f> uv measures the probability of returning to 
v before the first jump and does not depend on the outlinks from v if the other 
nodes do not change their outlinks. Thus, a simple greedy algorithm exists for 
deciding if a given graph is in Nash equilibrium and a nice characterization of 
Nash equilibria graphs is possible. Interestingly, it turns out that such graphs 
representing Nash equilibria have very strong regularity properties (see Section[3] 
for details) . The purpose of this paper is to study similar problems on undirected 
graphs. 

Motivation. Social networks have become one of the defining paradigms 
of our time, with enormous influence on how decisions are taken and events 
unfold. As with web graphs, content by itself will rarely be enough to explain the 
dynamics of these networks. The underlying graph structure itself surely plays a 
role in how new relations are formed and old relations broken. In considering two 
major social networks, Facebook and Twitter, a casual glance shows a radically 
different graph structure in spite of the fact that they have a comparable number 
of similar users. Facebook, an undirected graph, has few nodes with degree more 
than a 1000. Twitter, a directed graph, has nodes (such as Kate Perry) with 
in-degree 28 million and out degree just 115. A basic difference in the dynamics 
of the two networks is edge addition, which requires the approval of both nodes 
in an undirected graph but does not in a directed graph. The ability to add 
and delete links instantly in a directed network allows for an extremely rapid 
dynamically changing graph structure. Anecdotal evidence points to a much 
more stable graph structure in Facebook which apparently consists of large 
number of relatively small very dense subgraphs. 

Another example of an undirected network is the graph of international bi- 
lateral agreements between universities. We might consider PageRank as mea- 
suring how prestigious a university is, and universities might only accept agree- 
ments if it increases their prestige. Finally we might consider the coauthorship 
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graph, possibly one of the oldest social networks, defined so that people could 
find their "Erdos number". Here edge deletions are not permitted, but edge 
additions could conceivably be influenced by the PageRank of the given nodes. 

Our motivation is to build models for undirected graphs and to study their 
dynamics. Our basic tool will be to adopt PageRank as a quantity that users 
try to optimize. Under this assumption we will study how undirected networks 
evolve, what networks in equilibrium look like, and contrast this to the case of 
directed networks. Whether users of these networks actually behave in this way 
is beyond the scope of this paper. 

Outline of the paper. We introduce three different models for PageRank 
games on undirected graphs. Our study mainly focuses on the deletion-model 
(described in Section [3]) where a player cannot create a new link but may unilat- 
erally delete an existing one. In the directed web graph model described above, 
what v intuitively does is to cut its links to nodes u having smaller <p uv values, 
assuming that u will not delete its edge to v. In the deletion-model, if v cuts 
its outlink to u we assume that u also cuts its outlink to v either automatically, 
or as a form of revenge. In Figure [l] deleting edge ae increases e's PageRank 
but decreases a's. So e may unilaterally choose to do this. Note after the edge 
deletion, if a proposed to reinstate the edge e would refuse. 

Unlike the directed graph model, in the undirected model a node cannot 
add a new edge by acting unilaterally, but must seek the permission of the 
other node of the edge. It turns out that the class of equilibria graphs in the 
deletion-model is larger than in the directed model. Unfortunately, the nice 
property of the original model that <fi uv does not depend on the outlinks from 
v, no longer holds. Hence the greedy algorithm for the Nash decision problem 
does not work, either, and there seems to be no obvious way of checking the 
equilibrium condition. 

In Section 3.1 we give an 0(n 2 ) time algorithm for the case where the graph 
is a tree. In Section 3.2 we gave a parametric algorithm for general graphs, where 
the parameter k is the maximum degree of any vertex in any biconnected com- 
ponent that contains it. Biconnected components roughly correspond to local 
clusters of web pages, where one could expect the parameter k to be relatively 
small. Nodes linking biconnected clusters may have arbitrarily large degree 
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Figure 2: Edge addition 



without changing the time complexity. We give an 0(2 k n 4 ) time algorithm for 
general graphs. 

Our second model is the request- delete-model where a player can unilater- 
ally delete any existing edges and also can unilaterally create any new directed 
outlinks but cannot create a new inlink. In Section [4j we give an 0(n 3 ) time 
algorithm for trees which determines if the given graph is a Nash equilibrium 
in request-delete-models. It draws on the algorithm in Section [3. 1| 

The third model is called the add- delete-model where a player can delete 
any existing edges and also can add an undirected edge to another player if the 
PagcRank of both players are improved. In Figure [2] adding edge bf increases 
both 6's and /'s PageRank, so both parties would accept this addition. Note 
the PageRank of all other players decreases. 

While it may seem a great restriction to consider single edge additions, 
recall that we are interested in Nash equilibria. Multiple edge additions require 
the simultaneous decisions of multiple players. A player in this group cannot 
know the actions of the other players and hence cannot predict the new graph 
structure. Therefore it is not possible in general for a player to calculate whether 
or not an edge addition would improve her PageRank. In Section [5j we give an 
0(2 2k n 5 ) time algorithm for general graphs which determines if the given graph 
is a Nash equilibrium in add-delete-models. It draws on the algorithm in Section 
|3.2| We also give two structural type theorems. The first shows that the only 
a-insensitive equilibria are complete graphs. The second says that in symmetric 
graphs edge-addition will occur. This gives some theoretical justification for the 
anecdotal evidence cited earlier with respect to Facebook. 

Our results begin with the study of trees. This is not because social networks 
are likely to be trees but because the more complex parametric algorithms for 
general graphs are based on these results. 

Related work. Although it focuses less on game theoretic aspects, there 
is a large literature on optimal linking strategies to maximize the PageRank 
of given nodes for directed graphs. On the positive side, Avrachenkov and 
Litvak [2] give a polynomial-time algorithm for maximizing the PageRank of a 
single node by selecting its outlinks. Kerchove et. al. [7] extend this result to 
maximizing the sum of the PageRank of a given set of nodes. Csaji et. al. [4 
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give a polynomial-time algorithm for maximizing the PageRank of a single node 
with any given set of controllable links. 

On the negative side, [4] also shows that the problem becomes NP-hard if 
some pairs of controllable links in the set are mutually exclusive. Olsen [9] 
proved that maximizing the minimum PageRank in the given set of nodes is 
NP-hard if we are allowed to add k new links. He also proved that the problem 
is still NP-hard if we restrict the node set to a single node and the k links to 
only incoming ones to that node |10j and gives a constant factor approximation 
algorithm for this problem The question of whether there are a-sensitive 
Nash equilibria was recently affirmatively answered by Chen et. al. [3J. 

This paper is an extended version of an earlier paper presented at ISAAC 
2011 PP, which only covered the edge deletion model. 



2 Preliminaries 

2.1 PageRank values 

Initially we describe the Hopcroft-Sheldon directed graph model. Let D = 
(V, E') be a simple directed graph on node set V and arc set E' , and let q be a 
probability distribution on V. Throughout the paper we let n — \V\ denote the 
number of nodes. For v £ V, let T(v) denote the set of v's out-neighbours. A 
random jump is a move to a node according to the distribution vector q instead 
of using one of the outlinks of the current node. An a-random walk on D is a 
random walk that is modified to make a random jump with fixed probability 
a (0 < a < 1) at each step. The PageRank vector 7r over the n vertices in V 
is defined as the stationary distribution of the a-random walk. We define the 
potential matrix $ = (4> uv ) such that for vertices u, v € V, 4>uv is the probability 
that a random walk that starts from u visits v before the first random jump 
(4>uv — 1 ii u = v), which can be written as 

4>uv = t^*- V (f>iv (1) 

i£l (u) 



In order to calculate 7r, we have the following equation [6]: 

i _ (Lrg) v th- 
1 |i»| l^ier(v) Ptv 

Chen et. al. proved that ir v is continuous for a € (0, 1) [3]. 



(2) 



2.2 Directed PageRank games 

In the PageRank games in [5] the players are the nodes V of a directed graph 
D and they attempt to optimize their PageRank by strategic link placement. 
A strategy for node v is a set of outlinks. An outcome is an arc set E' for D 
consisting of the outlinks chosen by each player. The payoff of each player is 
the value of PageRank which is calculated on D. 

We say a player v is in best response, if v takes a strategy which maximizes v's 
PageRank in D. A directed graph D is a Nash equilibrium if the set of outlinks 
for each node is a best response: no player can increase her PageRank value by 
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choosing different outlinks. Several results for best response strategies and for 
Nash equilibria were introduced in [5] . In particular they gave a characterization 
of a-insensitive Nash equilibria, which are graphs being Nash equilibria for all 
values < a < 1 of the jump parameter. 

In this paper we study PageRank games for undirected graphs. Let G = 
(V, E) be an undirected graph on vertex set V and edge set E. Define the 
directed graph D = (V, E') on the same vertex set V, where each edge uv in E 
gives rise to two arcs uv and vu in E' . In our model, the payoff of each player 
v for the graph G is the PageRank of v in the corresponding directed graph D. 



3 Deletion Only Models 

In this section, we study the deletion-model for undirected PageRank games, 
where a player cannot unilaterally create a bidirected link to another node, but 
it can delete a bidirectional link. We consider the problem that determines 
whether the given graph is a Nash equilibrium in deletion-model or not. In 
Section [XT] we give a quadratic algorithm for the special case when G is a tree, 



and in Section 3.2 we give an 0(2 fe n 4 )time algorithm for general graphs where 
k is the maximum vertex degree on any biconnected component on G. 

In the deletion-model, we say that a player v is in best response if v cannot 
increase her PageRank by any deletion of her (bidirectional) links. A Nash 
equilibrium is a graph for which every player is in best response. We consider 
the following problem: 

Input: An undirected graph G, a, q. 

Output: Is the input a Nash equilibrium? (yes/no) 

An equivalent formulation is to decide whether no player can increase her 
PageRank for the given input, where she is only allowed to delete edges to her 
neighbours. As for directed graphs, we let T(v) denote the neighbours of vertex 
v inG. A strategy for v is to retain a subset E v C T(v) of neighbours and delete 
edges to her other neighbours. Let a; be a 0/1 vector of length d v = \T(v)\, which 
indicates u's strategy. Formally, if i £ E v then x% = 1, otherwise Xi = 0, for 
i = l,...,d v . Let (f> uv (x) denote the potential function ([I]) for the subgraph 
of G formed by deleting edges (v,i), i € T(v) — E v . By Q applied to the 
corresponding directed graph D the PageRank of v can be written as 

Kv ( x )=a Qu<t>uv(x) ^ 

l-(l-a) _ 

where x ^ 0. Let \ m denote a vector of ones of length m. Usually the length is 
clear by the context so for simplicity we may drop the subscript m. If the input 
is a Nash equilibrium then v is using a best response and no edge deletions for 
v will raise her PageRank. Therefore Tr v (ld v ) > tt v (x) for any 0/1 vector x. 
The approach we will use to solve the problem described in this section is to 
compute the maximum of tt v (x) over all 0/1 vectors x of length d v , for each 
vertex v. 

We give some examples in Figure § Graphs (a), (b) are a-insensitive Nash 
equilibria in directed PageRank games, and are also a Nash equilibrium in 



6 



deletion-models for any given a and q. Graph (c) is an example which is not 
a Nash equilibrium in directed games, since v can increase its PageRank if we 
delete the arc from v to 2 which has less potential than 1. However (c) is a 
Nash equilibrium in deletion-models for a — 0.15 and uniform distribution q, 
since if v cuts its edge to 2 then it decreases v's PageRank. Graph (d) is not a 
Nash equilibrium in both directed models and deletion-models for a — 0.15 and 
uniform distribution q, where K m is a m-complete graph, m = 8 for example. 
In this graph the potentials from 2, 3 to v are much less than 1, so v may try 
to cut the edge to 2 or the edge to 3, but a single edge deletion decreases v's 
PageRank. Interestingly, the deletion of both edges leads to a greater PageRank 
for v. 




(a) (b) (c) (d) 



Figure 3: Examples 



3.1 Trees 

In this section, we study the problem where the graph G is a tree. We prove 
the following theorem. 

Theorem 3.1 Given a tree G, jump probability a and distribution q, we can 
determine in 0{n 2 ) time whether G is a Nash equilibrium in the deletion-model 
and if not give an improving strategy for at least one player. 

The remainder of this section is devoted to the proof of this theorem. 

Let v be a node in G, let T(v) be the set of neighbours of v, and let d v = 
\T(v)\. Consider any strategy for v as described above and let x be the 0/1 
vector that represents it. For i £ T(v), let Ni be the set of nodes which are 
descendants of i (including i itself) in the subtree of G rooted at v. For a node 
u £ N h 

4> uv {x) = (j) uv Xi, (4) 

since potentials of all nodes in Ni depend on only link (v, i) and the other links 
of v do not affect these potentials. This is because if v cuts link (v, i), all nodes 
in Ni are disconnected from the other nodes in G. 
Therefore (|3) can be rewritten: 

/ \ Sier(u) J2ueNi Qu^uv x i 
TT v (x)=a — — ^ . (5) 

l-(l-a) _ 

Note that the potential matrix $ on G can be computed in 0(n 2 ) time, by using 
Gaussian elimination methods for each column vector defined by equation 
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([T]). Since G is a tree we can apply elimination steps in post-order, where 
we consider v to be the root of G. There are at most n forward eliminations 
and backward substitutions because every node except v has only one parent. 
Therefore it costs 0(n 2 ) time to compute <£>. 

Let dj = &J2 U £N Qu4>uv and let 6 ?; = 1 — (1 — a)4>i V for i £ T(v). Consider 
the fractional integer programming problem, 

P : maximize n v (x) = l T x — , x € {0, 1}™ 

b 1 x 

where > and bi > for i £ r(f) are known constants. 

In order to solve problem P, we fix the Hamming weight I of x and we solve 
the following problem for each I = 1, d v : 

Q : maximize f{x) = —= — subject to l T x = I. 
b 1 x 

Problem Q can be solved directly by Megiddo's method in 0(n 2 log 2 n) time [5], 
and it can be also solved by Newton's Method in time 0(n 2 logn) [T2]. However 
we are able to specialize Megiddo's method for our problem to obtain an 0(n 2 ) 
time algorithm. Our approach initially follows the technique describe in [8] . 

Since max7r„(cc) = max; / f{x), we can solve problem P by solving problem 
Q for each I = 1, d v . Consider the following maximization problem for some 
fixed 6. 

R : maximize g(x) = (a — bS) T x subject to l T x = I. 

Let Cj = Oj — bid, i e r(u) and let S(S) be the decreasing sequence of indices 
ordered by the values of Cj. Problem R is easily solved by choosing the first I 
indices in S{8). Let h(S) be the optimal value of problem R for a given 6, that 
is, h(6) — max{g(x) : l T x = I}. When h(5) = 0, then 5 is equal to 5* which is 
the optimal value of problem Q. On the other hand, if h(S) > then 5 < S* , 
and if h(S) < then S > S* , i. e., a root of h (see Figure]!]). The task in solving 
problem Q is to find the value S* for which h(5*) — by some tests on S. The 
key point is how many values of 5 have to be tested for finding 5*. Since the 
optimal solution for R can change only when the order of S(6) changes, we only 
have to test S at intersection values of lines {q = a,; — 6,;<5 : i € r(u)}. Using some 
results from computational geometry, we are able to do this efficiently. First we 




Figure 4: Solving h(S) = Figure 5: Path for I = 3 on H 

compute the line arrangement of lines {q = aij — bid : i € r(w)}. Namely we 
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define the planar graph H which is formed by subdivision of the plane induced 
by these lines. Then we look all edges in H as directed according to the positive 
direction of 5. 

For each / we test values of 5 at the change point of the Ith entry in S(6), 
which are the nodes in H lying on the Ith layer of the arrangement. An example 
with I — 3 is shown in Figure [5] 

We summarize the algorithm for solving P. First compute constants a,, &j, 
for i e r(t>), and line arrangement of lines {q = a, — &,<S : i € T(v)}. Compute 
5(0) by sorting the values of aj. For I = l,.,.,d v , do following steps: let Xi = 1 
if i is within Ith entry of 5(0), and otherwise x^ — 0. Compute A = a T x and 
B = b T x. From the starting edge, which is Zth edge from the top, follow the Z-th 
layer as follows: When we follow the edge on the line c, = dj — &i<5, and visit the 
node which is intersection of c; = ai~bi5 and Cj = aj — bjS, let A <— A — at + cij 
and B 4— B — bi + bj. If A — B5 < then output x as the solution, otherwise 
let Xi <— and Xj <— 1, and go to next node by following the edge on the line 

C 3 = ~ '>,*■ 

Finally, we analyze the running time of the algorithm. Computing the line 
arrangement takes 0(d%) time, which is done by the incremental method or 
topological sort algorithms (see Edelsbrunner [5]). Since the number of nodes 
in H is O(dl) and each of them is visited twice, we can find 5* for each I in 
at most O(d^) time. Therefore this algorithm solves problem P in 0(d%) time. 
Note that if v is not in best response the solution to P gives an improving 
strategy for v. 

As we have seen we can test whether a node v is in best response in 0(d%) 
time. Because G is a tree we have J2vev ^ v = 1), and we have X^ev dv < 
(Eugv^) 2 = 4( n — I) 2 - Therefore we can test whether all nodes are in best 
response in 0(n 2 ) time. This concludes the proof of Theorem 
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3.2 General Graphs 

In this section we give a parametric algorithm for general connected graphs 
based on a parameter k = k{G) defined as follows. If G — (V, E) is a tree we 
set k — 1 else k is the maximum vertex degree in any biconnected component 
of G. Note that k(G) can be computed in 0(|-E|) time by decomposing G into 
its biconnected components and by finding the maximum vertex degree in every 
such component. Note that graphs can have a large maximum vertex degree but 
small parameter k. This would occur whenever the large degree vertices were 
cut vertices. In a network setting the biconnected components could represent 
small groups of well connected web pages with relatively few links per page. 
These groups would be linked together by a few pages containing many more 
links. We prove the following theorem. 

Theorem 3.2 Given a graph G with k = k{G), jump probability a and distri- 
bution q, in 0(2 k n 4 ) time we can determine if this is a Nash Equilibrium in the 
deletion-model and if not give an improving strategy for at least one player. 

The remainder of this section is devoted to the proof of this theorem. 

Let v be a node in G and let {C\, C2, C^} be the set of connected com- 
ponents in the subgraph that is induced by deletion of v from G. It follows 
from the definition of k = k(G) that v has at most k links to Cj for i = 1, d. 
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Let Ui = {u : u £ Ci,u £ r(f)} indicate the set of v's neighbours in Q, for 
i = 1, d. We have \Ui\ < k by the definition of fc. Consider any strategy for 
v, as described in Section [3] and let x be the 0/1 vector of length d v — \T(v)\ 
that represents it. We write x — (x 1 , x 2 , x d ) as the concatenation of the 0/1 
vectors x % representing the strategy restricted to the component Cj, % = 1, ...,d. 
Then, for u £ V, if u £ Ci for i = 1, d, the potential of u is written as follows: 

4> uv {x)= ^ n c 1 -^) ( g ) 

sctr< ses seu v -s 

where ^>„„ are the potentials from u for the subgraph of G formed by deleting 
edges vu,u £ T(v) — S. This is because potentials to v from all nodes in C, 
depend only on links to Ui and never depend on other links. To compute each 
column vector ($)^ for S C J7j for i = 1, we solve the linear systems 
defined on ([lj by Gaussian elimination method in 0(2 k n 3 ) time. 
We have the formula for PageRank of v as 

where as and 65 be constants such that 

as = £ °ft.£(-l) |SHT| & ( 8 ) 

uGd-.scCi res 

, _ / l-(l-a)^ S = {u] 

bs I -(l-«)E ue sETCs(-l) |SHT1 ^ otherwise. |J1 

Note that ag > for all S and the denominator of tt v (x) is always positive. 

In order to determine whether v maximizes its PageRank, consider the fol- 
lowing fractional integer programming problem: 

P : maximize n v (x), x\ £ {0, 1} for s £ Ui, i — 0, ...,d. 

The method for problem P is the similar to that used in Section [3Tj We fix the 
Hamming weight of x as l T x = I, and consider the following fractional integer 
programming problem: 

m ■ ■ tt \ ^ Esct/j a s UseS x l , , 1 t 1 

Q : maximize J\X) = ~ — - — r subject to 1 x = I. 

T,i Escu* b s Uses x l 

Since max tt v (x) = max; I f(x), we can solve problem P by solving problem Q 
for each I — I, d v . Let 5 be a positive real number, and let cs = as — °S<5 for 
all 5 C Ui for i = l,...,d. 

i? : maximize 5(2:) = cs 2^ subject to l T a; = I, 

i SCUi ses 

Let h(6) be the optimal value of problem R for some S, i.e., /i(<5) = max x {<?(:c) : 
l T x = I}. Note that we do not have to find the optimum value S* of problem 
Q, since our goal is to determine whether v maximizes its value, that is, I S* > 
d v f(l) or not. All we have to do is to solve R for 5' — ^/(l), and determine 
whether h(S') < or not. 
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For converting g(x) in problem R to linear function, let y^t denote the 0/1 
variable and let e^t be the constant for 1 < i < d and 1 <t< \Ui\ such that, 

k {TCS 

yi : t indicates whether the number of edges used in the strategy x going from v 
to Ui is equal to t or not. If y^t = 1, let S C Ui be the t edges chosen. Then 
we consider that g{x) earns X)tcs c Ti with cost t. In the optimal strategy x, 
if Ui,t = 1 for some i, t then it must be that S maximizes X^tcs Ct ' that is e^t, 
as any other assignment can be improved to it. We then have the equivalent 
integer linear program to R: 

d \Ui\ d \u,\ 

R' : maximize e^t y^t subject to t yi.t = I (10) 

i=l t=l i=l t=l 

2l«,t < 1 for < = l 1 ... J d. (11) 
t=i 

Problem i?' is similar to a knapsack problem where each item has positive integer 
weight t and value en, and the total weight must be I. The only difference is 



the constraint (111 



Dynamic programming can be used to solve R! in 0(ld v ) time. Let w(i,t), 
for 1 < i < d and 1 < t < i, denote the maximum value which has total weight 
i and uses only the i first items. Let e^o = 0, then, 

w(i,t) = max — 1,4 — s) + s }. 

0<s<\Ui\ 

For each i = 1, we can compute w(i,t) for 1 < t < I in i|J7j| time. Since 
d v = X)i<i<d t ne computation time for solving R' is 0(ld v ). 

In order to determine whether u is in best response, we test g{x) for 8 = 
^-6', ^6'. Since d v < kd the running time per vertex is 0(2 k n 3 + k 2 d 3 + 
2 k d). Moreover ^2, ve yd v < 2kn from the assumption. Therefore, in order to 
determine whether every node is in best response, it takes Q(2 k n i + k 2 n 3 + 



2 ri) = 0{2 n ) time. This completes the proof of Theorem 3.2 



4 Request-Delete Model 

In this section, we study the request-delete-model for undirected PageRank 
games, where not only a player v can unilaterally delete bidirected links, but 
also v can create outlinks from v to any non- neighbours. We may consider 
these outlinks to be a form of requests to the other node to establish a link. 
We consider the problem of determining whether the given input is a Nash 
equilibrium in the request-delete-model or not. We give an 0(n 3 ) time algorithm 
for this when the underlying graph G is a bidirected tree. 

Let G be a tree and v be a node in G. Let T(v) denote the set of neighbours 
of v and let d v = \T(v)\. For i £ r(u), let iV, be the set of nodes which 
are descendants of i (including i itself) in the subtree of G rooted at v. In 
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the request-delete-model, a player v is in best response if any combination of 
edge deletions and creation of outlinks cannot increase PageRank of v. A Nash 
equilibrium is a graph for which every player is in best response. A strategy 
for v is to retain a subset E v C T(v) of neighbours and to choose a subset 
F v C V — T(v) for outlinks to her non- neighbours. Let x and y be a 0/1 vectors 
of length d v = \T(v)\ and of length \V — T(v)\ = n — d v respectively which 
indicate u's strategy. Formally, for i G r(w), Xi = 1 if i is in £"„ otherwise 
cc, = 0. Similarly, for u £ V — T(v), y u = 1 if it is in F„ otherwise y„ = 0. Let 
iT v (x,y) be the PageRank of u on the resulting graph for the strategy E v and 
F v . A node v is in best response if 71^,(1,0) > ir v (x,y) for any 0/1 vectors x 
and y. Our approach to solve the problem for verifying a Nash equilibrium is 
to compute the maximum of 7r v (x,y) over all 0/1 vectors x,y, for each vertex 
v. 

Since G is a tree and the outlinks of v do not affect <j) uv for any u G V, 
equation Q holds. By equation ([2]), the PageRank of v can be written as: 

n v ( X , y ) = (l T x + l T y) E ' er <"> a ' Xt Y . (12) 

where we let a = (a^), c = (cj) and e = (e u ) be constants such that 
di = a ^] q u (j>uv, <H = (1 - a)^ it) for i G T(w). 

e„ = (l — a)4>uv tor u -Y(v). 

The proof of the following lemma can be found in [5]. 

Lemma 4.1 Le< D be a directed graph. For a node v in D, if a node u =/= v 
has the maximum potential with respect to v then u is a in-neighbour of v. 

We prove following two lemmas. 

Lemma 4.2 Let G be a tree, v be a node in G, and let Ni be defined as above. 
For all i G T(v) and for all u G N(i) — {i}, 

Proof. Let T be the subtree of G induced by v and all nodes in Ni. Since G is 
a tree, the walk starting at the nodes in Ni cannot reach the nodes in V — N 
without visiting v. Thus the potential from each vertex u G T with respect to 



v is the same as the potential in G. By Lemma 4.1 only a neighbour of v can 
have the maximum potential to v in T and the other potentials are strictly less. 
This means 4>i V > <f) uv for all u G N(i) in G and concludes the proof. | 



By Lemma |4.2[ we have the following strict inequality. 

Cj > e u for u G Ni, for i G T(v). (13) 

Let y l denote a 0/1 vector over V — T(v) such that y l u — 1 if e u is within 
the Ith largest values in the all entries of e, and y u — otherwise. 
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Lemma 4.3 Let Tr v (x,y) and y l be defined as above. For I — 1, ...,n, 
max { 7r v (x,y) \ l T x + l T y = I } 



x.y 



max { n v (x, y l2 ) \ l T x = h, h + fa = I }• 



l<h<dv 
0<l 2 <n-d v 

Proof. By contradiction. When y = and y = 1, obviously the above equation 
holds, so we consider the case of y ^ 0, 1. 

Assume that (:c*,y*) is the maximum assignment for ir v (x,y) subject to 
l T x + l T y = I, and that y* ^ y 1 " 2 for any 1 < fa < n.— d v — 1. There exist it, w 
in y — r(w) such that e u > e w and = 0, y^ — 1. Let i denote the neighbour 
of v such that u is in Nj. 

When the case of x* = 1, the assignment y* = 1, y* u = decreases the de- 



nominator in the equation ( 12 ), since 1— e u < 1 — e w . This gives an improvement 
for n v (x, y), and does not change the Hamming weights of x and y, a contra- 
diction. Therefore we take x* = 0. However the assignment x* = 1, ?/*„ = 
dec reas es the denominator in the equation ( |12[ ) , since 1 — c, < 1 — e u < 1 — e w 
by ( 13 ). This gives an improvement, and does not change the value l T x + l T y. 



This contradiction concludes the proof. | 



Lemma 4.3 means that if a player v is in best response, v puts her outlinks 



to the nodes which have the higher potential to v. 

Theorem 4.4 Given a bidirected tree G, jump probability a and distribution 
q, we can determine in 0(n 3 ) time whether this is a Nash equilibrium in the 
request- delete-model and if not give an improving strategy for at least one player. 

Proof. Consider the following fractional integer programming problem: 

P' : maximize n v (x,y) x G {0,l} d ", ye{0,l} n ~ d ". 



We solve optimization problem P' for each v in V. For given v and each I 
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0, ...,n — d v , we compute y 2 as defined just before Lemma 4.3 Consider the 
following 0/1 optimization problem Q' for l\ = 1, ...,d v . 

x 

Q' : maximize f(x)= —= — . subject to l T x = l\ 

b 1 x 

where b % = ((1 - a) + EneA^-M (1 - eu)yi 2 ) for i G T(v). By using the 
algorithm to solve the problem Q in Section |3.1[ we can s olve problem Q' for 
all li = 1, d v in 0{d%) time. Let I = li + fa- By Lemma 4.3 the solution to 
problem Q' for each I = 1, ...,n gives also the solution to P since ir v (x,y l2 ) — 
(h + h)f(x). Thus, for each node v in V, we can determine whether v is in 
best response. It follows that we can determine whether the input is a Nash 
equilibrium. 

Finally, we analyze the running time of the algorithm. It takes 0{n 2 ) time 
to compute all potentials in G. For each l 2 = 0, 1, ...,n — d v , it takes 0(d 2 v ) 
times to solve the problem Q' . Therefore we can determine if a node v is in best 
response in O ((n — d v ) d%) time. Since G is a tree, J2 veV d v = 0(n). Thus we 
can test whether all nodes are in best response in 0(n 3 ) time. | 
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5 Add-Delete Model 



In this section, we introduce the add-delete-model, where each player v can 
delete any edges from v and can add one edge to any non-neighbour u if by so 
doing the PageRank of u increases. Otherwise we may presume that u would 
simply delete the edge (u,v). We consider the problem of determining whether 
or not the input is a Nash equilibrium and give an 0(2 2k n 5 ) time algorithm 
for general graphs, where k is the maximum vertex degree on any biconnected 
component in the graph. 

Let G — (V,E) be an undirected graph and let k = k(G) be a parameter 
defined as follows. If G = (V,E) is a tree we set k = 1 else k is the maximum 
vertex degree in any biconnected component of G. Let T(v) denote the set of 
neighbors of a node v in G. Let d v = \T(v)\. In the add-delete-model, the 
strategy for a player v is to retain the subset of neighbours from T(v) and to 
choose one non-neighbour u to add an edge between u and v. The PageRank 
of u must increase by the strategy. A player v is in best response if v cannot 
increase her PageRank by her any other possible strategies. A Nash equilibrium 
is a graph where every player is in best response. 

Let v be a node in G and let {C\, C2, Cd} be the set of connected compo- 
nents in the subgraph that are induced by deletion of v from G. It follows from 
the definition of k = k{G) that v has at most k links to C, for i = 1, ...,d. Let 
Ui = {u : u E Ci,u 6 r(i>)} be the set of v's neighbours in Ci for i = l,...,d. 
Let a; be a 0/1 vector of length \T(v)\ which indicates the strategy of v for edge 
deletion. We write x = (at 1 , x 2 , x ) as the concatenation of the 0/1 vectors 
x l representing the strategy restricted to the component Ci for i = 1, d. The 
potential (f> wv (x) on G is given by equation (|6| for each w in V. 

Theorem 5.1 Given a graph G with k = k(G), jump probability a and dis- 
tribution q, in 0(2 2k n 5 ) time we can determine if this is a Nash Equilibrium 
in the add-delete-model and if not give an improving strategy for at least one 
player. 

Proof. We give an algorithm which determines whether a player v in best 
response, for each vertex v in V. 

We fix v and for each of v's non-neighbours u perform the following steps. Let 
Cj be the component containing u. We initially decide the strategy vector x J for 
Cj by choosing a subset t/j C Uj. Let G' be the graph formed by adding («, v) to 
G and by deleting the edges to nodes not in C/j . Otherwise we retain the edges to 
the nodes in Ui, for each i = 1, ...d, i 7^ j. Let x~i = (x 1 , a;- 7-1 , x^ +1 , x d ). 
For each node w in Cj, let <fi® v be the potential calculated on G' . Since v is 
a cut vertex of G' , these potentials are invariant to x~i . By equation v's 
PageRank on the resulting graph for the strategy a:™- 7 on G' can be written as 
follows: 

*r (*-') = (iv* + m ^l^^i 8 ^: (u) 

b o + Ei^j Zscc/, bs Uses x s 
where as, bs are defined by equation (Jsj) and (fsj) , and a = aEt«ec, Iw^wv an d 

problem. 



bo = \Uj \ — (1 — a) Ylweu'. ^wv are known constants. We consider the following 



P" : maximize ir^ (x ■') subject to 7r„ (x J ) > 7T, 



G 
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In order to determine whether v maximizes its PageRank, we fix the Hamming 
weight I of x~i for each I = 1, |r(u)| — \U'j\ and consider the following problem 
Q". 

r\" ■ ■ a ° + Si/j ^SdUi a S Uses x% s , . , , ,t -j ; /1 K \ 

Q : maximize ^ ~ — - — = subject to 1 x J = I (15) 

fro + Ei#j T,SCU ( b s Us€S x l 

Trf(x-J) > vr.^16) 
According to the PageRank formulae for each w in Cj, 

n G '(x-i)-aa + V 

7r tt ^ j - a 9u , -t- 2^ irrii/ii ' 

Since the nodes in Cj are not adjacent to the nodes in C{ for each i = 1, 
i j, we can calculate the PageRank of u as follows. 

*f(x-J)=C + *. 



1^ + |^| 



where and are positive constants. Thus the constraint ( 16 1 in the problem 



Q" is 7r^ (x J ) > ' + t) (tt„ — C«)- Since our goal is to maximize (x J ), we 



can ignore the constraint ( 16 1 during the maximization. Let 5* be the optimal 



value of problem Q" without the constraint ( 16 1 . Player v is not in best response 

if tt£/(1 + \UI\) < S* and l - ± ^-{ir2 ' -( u ) < 5*. We can test these inequalities by 
the algorithm for problem Q in Section |3.2| This means that we can determine 
if v is in best response or not. 

Finally, we analyze the running time of the algorithm. For each non-neighbour 
u of v and for each subset [/j of Uj, it takes 0(n 3 ) time to compute the po- 
tential <j)® v for each w in Cj. It takes 0(2 k n 3 ) time for problem Q" for each 
I = 1, |r(t?)| — \Uj\. Therefore it takes 0(2 2k n 4 ) time per vertex and so we 
can determine if the input is a Nash equilibrium or not in 0(2 2fc n 5 ) time. | 



5.1 a-insensitive equilibria 

We prove that G is an a-insensitive equilibrium in the add-delete-model if and 
only if G is a complete graph using the following lemma. 

Lemma 5.2 If an undirected graph G is connected, for non-neighbours u,V in 
V there exists a® such that for all < a < Uq, 

\ G' ^ G 

where G' = (V,EU (u,v)). 

Proof. Recall that the stationary distribution in a standard random walk on a 
connected undirected graph is proportional to the degree of each vertex. In an 
a-random walk, when we have a — > 0, tt g converges to d v /d' and tt g converges 
to (d v + l)/(d' + 2) > d v /d', where d! — Ylvev d v and so d! > 2d v . By the 
continuity of n v for a 6 (0,1) [3], there exists ao such that ir G > ir^ and 
n u > n u f° r all < a < a - This concludes the proof. | 

An undirected graph G is called an a-insensitive equilibrium if G is an 
equilibrium for all < a < 1. 
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Theorem 5.3 If an undirected graph G is connected, G is an a-insensitive 
equilibrium in the add-delete-model if and only if G is a complete graph. 



Assume that G is an a-insensitive equilibrium and is not a complete 
and thus G has at least one non-neighbours of (u,v). By Lemma 5.2 



Proof. 
graph, 

there exists ao such that for a < olq, the edge addition [u, v) must increase the 
PageRank of both 7r u and tt v . This contradicts that G is an equilibrium. 

Conversely, assume that G is a complete graph. There does not exist a 
non-neighbour in G. G is a Nash equilibrium in the directed PageRank game 



since each node is in best response (see Section 2.2). Therefore any deletion of 
undirected edges for a node v does not improve her PageRank for any a. This 
means G is an a-insensitive equilibrium in the add-delete-model. | 

One may wonder if there are a-sensitive equilibrium in the add-delete-model 
which are connected and are not complete graphs. We do not know the answer 
to this question even if we assume q is a uniform distribution. However, we 
have found some a-sensitive equilibria in the add-delete-model without deletions 
allowed. That is, we allow single edge additions that raise the PageRank of 
both endpoints but we do not allow deletions. In the case of non-uniform q we 
have found many examples of a-sensitive equilibria. For uniform q we have the 
following examples. 





G, G 2 
Figure 6: a-sensitive equilibria: add-delete-without deletions, uniform q 



Figure [6] shows two graphs G\ and G2 which are examples of a-sensitive 
equilibria in the no-deletion add-delete-model with uniform q, where K m denotes 
the complete graph of size m. In Gi, there are three possible choices for an edge 
addition which are: (i) 1,3, (ii) 1,4 and (iii) 2,4. For example, take mi = 50, 
m 2 = 2 and a = 0.15. Case (i) decreases PageRank of node 1, case (ii) decreases 
PageRank of node 1 and case (iii) decreases PageRank of node 2. Therefore G\ is 
an equilibrium for a = 0.15. While we take a = 0.02, any of these cases increase 
the PageRanks of the two endpoints. Therefore G\ is not an equilibrium for 
a = 0.02. It follows that G\ is an a-sensitive equilibrium. The graph G2 is also 
an a-sensitive equilibrium for mi = 10000, m2 — 100, 7713 = 2 and a = 0.15. We 
remark that these are not equilibria if deletions are allowed. In Gi the deletion 
of (1, 3) improves the PageRank of node 3. Similarly in G3, the deletion of (3, 4) 
improves the PageRank of node 4. 

5.2 A Sufficient Condition for Edge Addition 

Throughout this section, we assume that the distribution vector q is uniform, 
i.e., q v = 1/n for all v € V. We show that if G is symmetric with respect to 
non-neighbours u, v in G, then an edge addition between it, v must increase the 



16 



PagcRank of both u and v. An automorphism mapping on G is a permutation 
a over the vertices such that (a(u),a(v)) e E if and only if (u,v) e £7. Let 
Aut(G) be a set of automorphisms on G. 

Theorem 5.4 If an undirected graph G has non-neighbours u, v and has a graph 
automorphism a £ Aut(G) such that a(v) = u and a(u) = v, then 

where G' = (V,EU (u,v)). 

Proof. Let Zij be the expected number of times in which an a-random walk 
visits j before the first jump when the walk starts at i. 



= E 



,7-1 



t=0 



X = i 



where J > 1 is the time of the first jump and I{X t — j} denotes the indicator 
for the tth step of a-random walk visiting j. Since the mapping a preserves 
adjacency of all neighbours in G, we have — 2 CT (iW(j) for all i,j <G V. Since 
q is the uniform distribution, 



- -Vz 

n ^ — ' 



wEV 

- y 

n z <r(w),<r(v) 

wev 



a \ -> 
n ^ 



Kiev 

Similarly, (<r (u) , er (w)) = (w, u) is in G' , so the mapping cr is in Aut(G'). Hence 
we have 7r^ = 7r^ . 

For a node w in V, let denote the expected number of times in which 

the walk visits u and v when the walk starts at w before a random jump. By 
the linearity of expectations, 



E 



.7-1 



= E 



J2(I{Xt = u} + I{X t = v}) 

t=0 

J2l{X t = u} 



. t=0 







X = w 


+ E 



X = w 
j-i 



X = w 



. t=0 

Let nP , denote the sum of PagcRank of u and v on G. Then we have, 



ttG , G 



+ z„ 
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We compute the expected number of visits to u and v when the walk starts 
at w. Firstly we compute the probability that the walk reaches u or v and 
consider the expected number of walks revisiting u and v. We have z w \ uv ] — 
4>w[uv] z [uv] [uv] , where <fi w [ uv ] is the probability that a walk starting from w reaches 
u or v before the first jump. The term zr ut ,][ u „] counts the number of returns 
from uUv to uUv before the first jump occurs. 

Xq =uU Xq = v . 

Each return succeeds with probability <bt „ ,, where df , is the probability 

* J T [uv] [uv] 3 ~|UUj|UUj r J 

of the walk returning from u U v to u U v before the first jump. Hence we have 

Z[uv][uv] = 1 + <t>[ uv ]l uv ] + (0[Ll)][ui)]) 2 ^ = ^ _ 7 + • 

'[uv] [uv] 

Since we have a(v) — u, cr(u) — v and q v = q u = 1/n, 



" j-i 



(1 



|r(«) 



ier(v) 



(17) 



Now we conclude the theorem. If we add a new edge between u and v to 



have 



G, the right-hand in (17) becomes strictly larger than the before, because we 
By equation) 17 1, this means that ttP , > 7r£ 



= 1. 



= 27T„ = 2ir, 



[uv] 



We also have 



This concludes the proof. 



6 Conclusion 

We have constructed three different models for PageRank games on undirected 
web graphs and studied the problem of verifying Nash equilibria in each model. 
The algorithms obtained have high complexity and are thus useable only on 
small scale networks, or on a small fragment of a much larger network as an 
approximation. From the algorithmic side, the main open question is whether 
polynomial time algorithms exist for checking Nash equilibria in general graphs. 

Other results in this paper concern the evolution of PageRank graphs if 
players act unilaterally to improve their page rank. We showed in Theorem 
|5.3| that complete graphs are the only a-insensitive equilibria in the add-delete- 
model. It would clearly spell doom for any social network if this happened! Are 
there any non-a-insensitive equilibria? 
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